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TRANSCODING IN DATA COMMUNICATIONS 



This invention relates generally to transcoding in data communications systems. More 
particularly, embodiments of the invention provide transcoders and transcoding methods, and 
5 apparatus and methods for use with or incorporating such transcoding systems. 

Data access and manipulation devices proliferate in many different forms, with widely 
varying input, output and processing capabilities. This creates difficulties in providing general 
purpose access to centralized processing and database systems, since such a large range of 
devices must be accommodated. Typically such centralized systems are primarily designed to be 

10 accessed by powerful devices such as PCs which have sophisticated processing and I/O 

capabilities compared to many data manipulation devices currently in use. As an example of the 
problem, it is becoming increasingly desirable to provide users with mobile access to centralized 
systems over the Internet via portable devices such as small, hand-held computing devices, 
pagers and mobile phones: In the case of the Internet, for example, the vast majority of users 

15 have access via browsers running on powerful computing devices such as PCs with relatively 
high-speed, high bandwidth communications links, and the nature of the data that can be 
retrieved from the Internet, in terms of the structure, variety and complexity of its content, has 
developed with such powerful devices in mind. However, the expectations of the data handling 
capabilities of the recipient device far exceed the capabilities of many portable access devices, 

20 which may have slow communications links, limited processing power and unsophisticated 
display hardware. 

To accommodate such a range of data access devices and allow them access to 
centralized systems, modem data delivery chains incorporate devices known as "transcoders**. A 
transcoder processes generically formatted data content in a message received from a source 

25 device such as a server to produce a device-specific data message adapted to the capabilities of 
the intended destination device. Common tasks that a transcoder might perform include the 
removal of non-essential data, conversion between different data formats, data compression or 
decompression, and general processing of data content to simplify the resulting message. In 
simple terms, however, transcoder operations can be categorized as one of three main types of 

30 operation, namely: omitting data, whereby certain data is removed from the received message; 
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maintaining data, whereby certain data in the received message is maintained without change; or 
modifying data, whereby certain data in the received message is changed in some way, eg. by 
altering the existing data through processing, or replacing the existing data with new data. In this 
context, it will be understood that the "message" on which the transcoder operates may be any 

5 type of data communication to be delivered from a source device to a destination device, from a 
simple document to a complex communication with textual, graphics, audio or visual content. 

Incorporating the transcoding function into source or destination devices is impractical 
for all but a few highly security-sensitive applications due to the additional software and 
hardware requirements and the consequent cost implications, particularly as data access devices 

10 and transcoder functionality evolve quite rapidly. External transcoder services, provided for 
example by portable device manufacturers, network operators or ISPs, offer a more practical 
solution. In such cases in particular, however, the question of security arises. Specifically, the 
"verifiability" of the transcoder action, ie. the ability of the end user to verify that the message 
content has not been unacceptably or maliciously altered in the transcoding process, becomes a 

15 concern. Common cryptographic facilities, such as "message hashing", can provide verification 
that a message has not been altered during transit, but transcoders need to alter messages in order 
to accomplish their task. While some of the alterations may be legitimate, others could be 
malicious. As a highly simplistic example, consider that the following message is received by a 
transcoder from an origin server: 

20 Original message: Do you wish to transfer $ 10 from account A to account B? 

For a destination device with limited output capability, a transcoder may generate View 1 as 
follows: 

View 1: transfer $10 from A to B? 
Alternatively, the message might be altered to View 2 as follows: 
25 View 2: transfer $ 100 from B to A? 

Clearly View 1 is a legitimate rendition of the original message whereas View 2 is a malicious, 
unacceptable rendition. 

It should be evident that an automatic method for verifying the semantic content of a 
message against the original is not feasible. For example, another possible rendition of the above 
30 message is View 3 as follows: 
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View 3: credit $10 from B to A? 
This is a legitimate rendition of the original message, but it is infeasible to verify automatically 
that the meaning of "credit" here is equivalent to the meaning of "transfer** in the original 
message. 

5 It will be apparent from the above that an efficient system allowing verification of 

transcoder action to the extent feasible would be of significant advantage in data 
communications systems where transcoding is required. 

According to one aspect of the present invention there is provided a method of 
transcoding a data message, comprising a plurality of data fields and an authentication code, to 
10 produce a transcoded message for transmission to a destination device, the data message being 
received from a source device wherein said data fields have been coded in accordance with a 
first coding system, whereby respective data field codes are generated for said data fields and a 
message code is derived from said data field codes, and wherein said message code has been 
coded in accordance with a second coding system to generate said authentication code, the 
1 5 method comprising: 

determining for each data field of the received data message whether to maintain, modify 
or omit that field; 

for a field to be maintained, maintaining that field in said transcoded message; 
for a field to be omitted, coding the field in accordance with said first coding system to 
20 generate an omitted field code dependent upon the data field code for that field, and replacing 
that field by said omitted field code in the transcoded message; and 

for a field to be modified, coding that field in accordance with said first coding system to 
generate a modified field code dependent upon the data field code for that field, and replacing 
that field by a modified field, comprising modified data and said modified field code, in the 
25 transcoded message; and 

including said authentication code in the transcoded message. 

In transcoding methods embodying the present invention, therefore, on receipt of a data 
message as defined above, it is determined in the usual way whether to maintain, modify or omit 
each data field of the received message, and the transcoded message is produced accordingly. In 
30 particular, a field to be omitted is replaced by an omitted field code in the transcoded message, 



and a field to be modified is replaced in the transcoded message by a modified field comprising 
the modified data and a modified field code. Generation of an omitted or modified field code 
involves at least the step of coding the original field in accordance with said first coding system, 
whereby the resulting omitted or modified field code is dependent on the data field code for that 

5 field. The resulting transcoded message comprises maintained data fields, omitted field codes, 
modified fields and the authentication code from the received message. Since the omitted and 
modified field codes are dependent on the original data field codes generated by the source 
device, a destination device to which the first coding system is known can derive from the 
transcoded message all the information it needs to regenerate the message code. Assuming the 

10 second coding system is also known to the destination device, the destination device can then 
verify that the message code derived from the transcoded message corresponds to the message 
code encoded in the authentication code by the source device. Thus, with transcoding methods 
embodying the invention, sufficient information is included in the transcoded message to enable 
an appropriately preconfigured destination device to regenerate and verify the message code 

15 against the message code authenticated by the source device. Moreover, the transcoding system 
is such that the destination device can identify the nature of the transcoding operations 
performed on the original message, and this information can be used to further advantage by the 
destination device, for example by displaying this information to the user. Embodiments of the 
invention therefore provide an efficient and practical system giving a high degree of verifiability 

20 of transcoder operation. 

In preferred embodiments, for the sake of simplicity, the omitted field code for a field is 
simply the data field code for that field, at least for some instances of a field to be omitted. Other 
systems can be envisaged, however, in which the omitted field code is otherwise related to the 
data field code, for example by further processing the data field code in some way to obtain an 

25 omitted field code from which the data field code can be derived by the destination device. 

Where derivation of the message code in the source device involves coding the data field codes 
for predetermined groups of fields to generate respective group codes, then in preferred 
embodiments, for a field to be omitted: if all fields in the corresponding group are to be omitted, 
then the omitted field code comprises the group code for that group, and the group of fields is 

30 replaced by said group code in the transcoded message; and if less than all fields in the 
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corresponding group are to be omitted, then the omitted field code comprises the data field code 
for that field. Use of the group code to replace a group of fields in this way enables the resulting 
transcoded message to be simplified. An example of a type of coding system which may be 
employed as the said first coding system, and to which this method may be applied, is a hashing 
5 algorithm. In this case, the data field codes may be hash values calculated from the original data 
fields, and a "hash tree" may be calculated over these data field codes such that the 
aforementioned group codes are the hash values of parent nodes of the hash tree. This will be 
described in more detail below. 

In some embodiments, the modified field code for a field could simply be the data field 

10 code for the received field. However, in preferred embodiments, when a received field is to be 
modified to produce a modified field including modified data, the modified field code is 
obtained by generating the data field code for the received field, coding the modified data in 
accordance with the first coding system to generate a modified data code, and representing the 
difference between that data field code and modified data code in the modified field code. The 

15 modified field code may represent the aforementioned difference in a number of ways. For 
example, the modified data code may simply be subtracted from the data field code to generate 
the modified field code. Alternatively, for example, an exclusive-OR operation may be applied 
to the data field code and modified data code to generate the modified field code. Other such 
reversible operations will be apparent to those skilled in the art, the point being that the 

20 destination device can regenerate the data field code from the modified data and the modified 
field code. Making the modified field code dependent on both the data field code and the 
modified data in the modified field provides an additional level of security, since malicious 
replacement of the modified data after the transcoding process would result in an erroneous data 
field code being derived at the destination device, and hence failure of the authentication 

25 process. 

The present invention also provides a method of processing a data message, comprising a 
plurality of data fields and an authentication code, received from a source device wherein said 
data fields have been coded in accordance with a first coding system, whereby respective data 
field codes are generated for said data fields and a message code is derived from said data field 
30 codes, and wherein said message code has been coded in accordance with a second coding 
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system to generate said authentication code, the method comprising transcoding the received 
data message by a transcoding method as described above, transmitting the transcoded message 
to the destination device, and, in said destination device: 

deriving a received message code from the transcoded message using maintained fields, 
5 modified fields and omitted field codes in said message in accordance with said first coding 
system; 

comparing the received message code with the message code encoded in said 
authentication code in accordance with said second coding system; and 

displaying a user message dependent upon the result of the message code comparison. 
10 If the received message code does not tally with the authentication code, the resulting 

user message could simply indicate that the received message was invalid, and not display the 
message itself. However, at least if the received message code is identical to that encoded in the 
authentication code, the relevant content of the transcoded message, ie. at least the maintained 
data fields and modified data, can be displayed in the user message. The user message may also 
15 explicitly indicate that the message has been authenticated. In addition, in preferred 

embodiments the user message includes transcode indicators indicative of the location in the 
displayed message of fields omitted or modified in the transcoding process. This allows the user 
to make a personal assessment of whether the message should be relied upon. Moreover, in 
preferred embodiments where the original message sent by the source device is stored as part of 
20 the transcoding process, provision can be made for the destination device to request omitted 
fields, or the original content of modified fields, from the transcoder in response to a user input. 
These original fields can then be displayed to the user. 

In general, where features are described herein with reference to a method of the 
invention, corresponding features may be provided in accordance with apparatus of the 
25 invention, and vice versa. Thus, for example, a further aspect of the present invention provides a 
transcoder for transcoding a data message, comprising a plurality of data fields and an 
authentication code, to produce a transcoded message for transmission to a destination device, 
the data message being received from a source device wherein said data fields have been coded 
in accordance with a first coding system, whereby respective data field codes are generated for 
30 said data fields and a message code is derived from said data field codes, and wherein said 




message code has been coded in accordance with a second coding system to generate said 
authentication code, the transcoder comprising: 

a memory for storing the received data message; 



transcoder logic configured to determine for each data field of the received data message 
5 whether to maintain, modify or omit that field, and to produce the transcoded message from the 
received data message; and 

means for transmitting the transcoded message to the destination device; 

wherein the transcoder logic is configured to produce the transcoded message from the 
received data message by: 
10 for a field to be maintained, maintaining that field in said transcoded message; 

for a field to be omitted, coding the field in accordance with said first coding system to 
generate an omitted field code dependent upon the data field code for that field, and replacing 
that field by said omitted field code in the transcoded message; 

for a field to be modified, coding that field in accordance with said first coding system to 
15 generate a modified field code dependent upon the data field code for that field, and replacing 
that field by a modified field, comprising modified data and said modified field code, in the 
transcoded message; and 

including said authentication code in the transcoded message. 

Another aspect of the present invention provides a destination device for receiving a 
20 transcoded message from a transcoder as defined above, the device comprising a memory for 
storing a received transcoded message, a display, and control logic configured to: 

derive a received message code from the transcoded message using maintained fields, 
modified fields and omitted field codes in said message in accordance with said first coding 
system; 

25 compare the received message code with the message code encoded in said 

authentication code in accordance with said second coding system; and 

to supply a user message, dependent upon the result of the message code comparison, to 
the display for display to a user. 

The invention further extends to a data delivery system comprising such a transcoder and 
30 one or more such destination devices. Moreover, the invention extends to a data communication 
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system comprising such a data delivery system and a source device for generating a data 
message for transmission to a said destination device, the source device comprising message 
processing logic configured to: 

divide data to be included in the data message into a plurality of data fields; 
5 code said data fields in accordance with said first coding system whereby respective data 

field codes are generated for said data fields and a message code is derived from said data field 
codes; 

and to code said message code in accordance with said second coding system to generate 
an authentication code for the message; 
10 the source device including means for transmitting a data message, comprising said 

plurality of data fields and said authentication code, to the transcoder of said data delivery 
system. 

The message code could be derived in various ways from the field codes depending on 
the particular nature of the first coding system employed. For example, the message code could 

15 be obtained by processing one or selected field codes, but preferably all field codes are used in 
derivation of the message code for the sake of security. While the message code could simply be 
the group of field codes collectively, in preferred embodiments the field codes are further 
processed to generate the message code. Similarly, while various encryption algorithms might be 
employed as said second coding system, a signing function can conveniently be used here. 

20 Preferred embodiments of the invention will now be described, by way of example, with 

reference to the accompanying drawings in which: 

Figure 1 is a schematic illustration of a data communication system embodying the 

invention; 

Figure 2 illustrates the principle of a hashing algorithm employed as the first coding 
25 system in the embodiment of Figure 1 ; 

Figure 3 is a schematic illustration of a simple data message format which may be 
employed in embodiments of the invention; 

Figure 4 is a flow chart describing operation of the transcoder in the system of Figure 1 
on receipt of a data message; and 



H 



Figure 5 is a flow chart describing operation of the destination device in Figure 1 on 
receipt of a transcoded message. 

Figure 1 shows the main elements of a data delivery chain in a data communication 
system embodying the invention. As illustrated, the system includes a source device 1 which can 
5 communicate with a transcoder 2 via link 3. The source device 1 may be a server for a 

centralized processing or database system, for example a network server or an Internet server, 
and link 3 may be a hard-wired or a wireless link as appropriate. The transcoder 2 can 
communicate with a destination device 4 via link 5. The destination device 4 may be, for 
example, a portable device such as a hand-held computing device, pager or mobile phone, and 
10 may be one of a number of such devices with which the transcoder can communicate. The link 5 
may again be a hard-wired or wireless link as appropriate depending on the nature of destination 
device 4. 

The elements of the source device 1, transcoder 2 and destination device 4 involved in 
operation of the data communication method to be described below are illustrated schematically 

15 in the figure. In particular, the source device 1 includes processing logic in the form of message 
processor 6 for generating a data message to be transmitted to destination device 4. In this 
example, the processor 6 is configured by software to formulate the data message as described in 
more detail below. Alternatively, hard-wired logic may be used to implement the message 
processing logic in some embodiments. Either way, suitable implementations will be apparent to 

20 those skilled in the art from the description herein. While message processor 6 is illustrated for 
simplicity as having an input for receiving data from which the data message is to be formulated, 
in practice the message processing logic may be integrated with other functional logic in a 
processor performing various tasks for the source device, and the data in question may be 
generated by performance of such a task. Source device 1 also includes transmitter circuitry 7 for 

25 transmitting the data message to transcoder 2 via link 3, the specific implementation of the 

transmitter circuitry 7 depending on the nature of link 3. (While transmitter circuitry 7 is shown 
for the purposes of this description, in practice, of course, receiver circuitry via which the source 
device receives incoming communications may also be provided). 

The transcoder 2 includes transceiver circuitry 8 for receiving data messages from source 

30 device 1 and communicating data with destination device 4. Again, the specific form of this 
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circuitry depends on the nature of the links 3, 5. The transcoder 2 further includes transcoder 
logic in the form of transcode controller 9 for processing a received message to generate a 
transcoded message for onward transmission to the destination device 4. Transcode controller 9 
may be implemented by a processor configured by appropriate software as will be apparent to 

5 those skilled in the art from the description herein. A memory 10 is associated with transcode 
controller 9 for temporary storage of received and transcoded messages. 

Destination device 4 includes transceiver circuitry 1 1 for communicating data with 
transcoder 2, and control logic in the form of control unit 12, again implemented by a suitably 
programmed processor, for performing the message authentication functions described below 

10 and controlling operation of the device generally. The device further includes a memory 13, 
display 14 and user input means 15, for example a keypad, via which the user can input 
instructions to the control unit as described further below. 

In operation, when a data communication is to be sent from source device 1 to 
destination device 2, the data is processed by message processor 6 as described further below to 

15 generate the data message which is transmitted to transcoder 2 via transmitter circuitry 7. The 
transcode controller 9 receives the data message from transceiver circuitry 8 and stores the 
message in memory 10. The transcode controller then determines for each field of the data 
message whether that field should be maintained, omitted or modified to produce a message in a 
form suitable for forwarding to the destination device 4, (The particular system by which the 

20 transcode controller makes this determination is not central to operation of the present invention 
and need not be discussed in detail here. Suffice to say that this process may be performed in 
various ways in accordance with known transcoding systems). The transcode controller 
generates the transcoded message from the received message as described in more detail below, 
the transcoded message being stored in memory 10. The transcoded message is then transmitted 

25 to destination device 4 via transceiver circuitry 8. At destination device 4, the transcoded 
message is received by control unit 12, via transceiver circuitry 11, and stored in memory 13. 
The control unit 10 then checks the authenticity of the received message and supplies a user 
message, dependent on the result of the authentication process, to display 14 for display to the 
user. These processes and subsequent operations available to the user will be described further 

30 below. 
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Operation of the message processor 6 of source device 1 will now be described in more 
detail. The message processor generates a data message from input data using first and second 
coding systems. The first coding system is also used by transcoder 2 to generate the transcoded 
message, and both of the first and second coding systems are employed in destination device 4 to 
5 authenticate a received transcoded message, the appropriate coding systems being 

preprogrammed in transcode controller 9 and control unit 12 in this embodiment. Prior to the 
coding stages, the message processor 6 first divides the input data into a number of data fields. 
The set of data fields form one portion of the data message to be sent to the transcoder. Division 
of the data into fields can be carried out in various ways, for example one field per word, 

10 sentence, paragraph or other specified data quantity, and may depend on the nature of the data 
content as appropriate. Since the transcoder 2 must be able to identify the individual data fields, 
field boundaries may be specifically indicated in the data message, for example by inclusion of 
field markers identifying the field boundaries. Alternatively, the transcoder may be 
preprogrammed with the field division system used by the message processor 6. Either way, 

15 once the fields have been defined, the message processor 6 then processes the fields to generate 
an authentication code to be included in the data message. This is a two-stage process employing 
both the first and second coding systems mentioned above. In the first stage, the first coding 
system is employed to generate a data field code for each data field and then to derive a message 
code from the data field codes. In the second stage, the resulting message code is encoded using 

20 the second coding system to generate the authentication code. In the present embodiment, a 
hashing algorithm is employed as the first coding system and a signing function is employed as 
the second coding system. Operation of the hashing algorithm is illustrated in Figure 2. 

Figure 2 illustrates successive steps of the hashing algorithm for the case where ten data 
fields f 1 to f 10 are included in the data message. In the first step, a hash function H is applied to 

25 each field f to generate a hash value H(f) as the data field code for that field. Thus, a set of ten 
field hash values, indicated as hi to hlO, are generated from the ten fields. A hash tree is then 
computed over the set of field hash values hi to hlO whereby the hash function H is applied 
again to groups of the field hash values to generate group hash values corresponding to parent 
nodes of the hash tree, the hash function H then being applied iteratively to groups of the parent 

30 nodes, and so on until a single "root hash value" is obtained. In the particular example illustrated 
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in the figure, the hash function H is applied to pairs of neighboring field hash values hi to hlO to 
generate group hash values hl2, h34, h56, h78 and h910 as indicated. The function H is 
reapplied to the first two pairs of these group hash values to generate group hash values hl234 
and h5678. The function H is then reapplied to hl234 and h5678 to obtain group hash value 

5 hl2345678- Finally, the function H is applied to group hash values hl2345678 and h910 to 
obtain root hash value hl-10 which constitutes the message code for the data message. 

After generation of the message code hl-10, the message processor 6 applies a signing 
function Sgn to the message code to generate the authentication code Sgn(hl-lO). The 
authentication code is then added to the original data fields to form the data message to be sent 

10 to the transcoder 2. Figure 3 illustrates an example of the resulting data message, consisting of 
the authentication code Sgn(hl-lO) and ten data fields f 1 to flO, with field markers employed to 
identify the field boundaries in this case. While this represents a simple example of a data 
message, it will be appreciated that the data message may be more complex in practice. For 
example, additional data may be included in the message, such as data relating to constraints to 

15 be applied to the transcoding process. Moreover, as will be apparent to those skilled in the art, 
addressing of messages to a particular transcoder and/or destination device will generally be 
handled by the transmission protocol employed by the system, eg. HTTP in the case of an 
Internet server. Such transmission protocols are not central to the present invention and need not 
be discussed here. 

20 Operation of the transcoder 2 on receipt of the data message will now be described with 

reference to the flow chart of Figure 4. This is essentially a two-stage process. In the first stage, 
the transcoder determines whether each field of the message should be maintained, modified or 
omitted, and calculates hash values as appropriate. In the second stage the transcoder determines 
whether the length of the transcoded message can be reduced by "compressing" hash values, ie. 

25 replacing the hash values for a succession of omitted fields by the group hash value for a parent 
node of the hash tree of Figure 2. The process begins at step 20 when the data message is 
received by transcode controller 9. In step 21 the received message is stored by transcode 
controller 9 in memory 10. In step 22, the transcode controller 9 then analyses the first field f to 
determine if this field should be omitted from the message sent to the destination device. If the 

30 field is not to be omitted, the process proceeds to step 23 in which the transcode controller 
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decides if the field should be modified in some way, eg. by altering the existing field content or 
replacing the field content by new data. If no modification is required, the existing field is to be 
maintained. In this case, operation proceeds to step 24 wherein the original field f, with added 
field marker, is stored in memory 10 as a first portion of the transcoded message. In this 
5 embodiment, the field marker added by the transcode controller includes data indicating the 
nature of the field content, ie. whether the field content represents a maintained, modified, or 
omitted field, to facilitate subsequent processing by the destination device. Thus the field marker 
added in step 24 here indicates that the field content is a maintained field. Operation then 
proceeds to step 25 wherein the transcode controller determines if there are any more fields of 

10 the received message to be considered. Assuming so, operation reverts to step 22 where the next 
field is considered for omission. 

If it is determined in step 22 that the current field should be omitted, then in step 26 the 
transcode controller calculates an omitted field code in the form of the field hash value H(f) for 
the current field. Then, in step 27, the hash value H(f) is stored in memory as the next field of 

15 the transcoded message, with an associated field marker indicating that the field content is a 
hash value for an omitted data field. From step 27, operation proceeds again to step 25 in which 
the transcode controller 9 determines if there is a further field to be considered, and if so the 
process reverts to step 22 for analysis of this next field. 

Returning to step 23, if it is decided here that the current field content should be 

20 modified to f , then operation proceeds to step 28. Here, the transcode controller calculates a 
modified field code in the form of the "delta hash value" Hd = H(f) - H(f ). Thus, the transcode 
controller applies the hash function H to both the original data field f and the modified data f to 
obtain the original field hash value H(f) and also a modified field hash value H(f ). The 
arithmetic difference between these two is then taken as the delta hash value Hd. In step 29, Hd 

25 and the modified data f are stored in memory 10, with an associated field marker, as the next 
field of the transcoded message. The field marker added here indicates that the field contains a 
delta hash value and modified data. Operation then proceeds to step 25, and reverts again to step 
22 if there is a further field to be processed. 

After all fields have been processed in this way so that no further fields are identified in 

30 step 25, the first stage of the transcoding process is complete. Operation then progresses to steps 
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30 to 32 which represent the second stage of the transcoding process. In step 30, the transcode 
controller analyzes the transcoded message stored in memory 10 to determine if it contains a 
series of field hash values which can be compressed. This is possible where there is a series of 
field hash values representing a group of omitted fields corresponding to parent node of the 
5 hash tree of Figure 2. In particular, if all the fields corresponding to any parent node have been 
omitted, then it is sufficient to send the group hash value for that parent node in the transcoded 
message. For example, referring to Figure 2, if fields f 1 and f2 of the received message have 
been omitted, then the individual field hash values hi and h2 in the transcoded message can be 
replaced by the group hash value hl2. Similarly, if fields f 1 to f4 have been omitted, then the 
10 field hash values hi to h4 can be replaced by the group hash value hl234. Thus, the transcode 
controller checks whether the transcoded message contains a group of field hash values which 
can be compressed in this way, and if so operation proceeds to step 31. Here, the appropriate 
series of fields in the transcoded message is replaced by a single field containing the parent node 
hash value, and the field marker added in this case indicates the number of omitted fields which 
15 that hash value represents. Operation then proceeds to step 32. 

In step 32, the authentication code Sgn(hl-lO) from the originally received message is 
added to the transcoded message fields stored in memory 10, and the resulting transcoded 
message is output to the transceiver circuitry 8 for transmission to the destination device 4. 
Reverting to step 30, if it is determined here that the message does not contain a group of field 
20 hash values which can be compressed as described, then operation proceeds directly to step 32. 

It will be seen from the above that, in constructing the final transcoded message which is 
sent to destination device 4, when it is decided to omit a field of the received message (but not 
all fields corresponding to a parent node in Figure 2), then the original field is replaced by the 
field hash value in the transcoded message. When a group of fields in the received message 
25 which correspond to a parent node in the hash tree are omitted, then that group of fields is 

replaced by the parent node hash value in the transcoded message. When a field of the received 
message is to be modified, that field is replaced by the modified data and the appropriate delta 
hash value in the transcoded message. When it is decided to maintain a field of the received 
message, that field is simply maintained in the transcoded message. As a simple example, 
30 consider receipt of a data message with the content "Do you wish to transfer $10 from checking 
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account A to savings account B?". If the received message contains fields f 1 to f 10 defined 
below, then the transcoder might generate a transcoded message with transcoded fields tl to t9 
as follows: 



Received Fields 


Transcoder Action 


Transcoded Fields 


f 1 = "Do you" 


omit 




f2 = "wish to" 


omit 


tl =hl2 


D = "transfer" 


maintain 


t2 = "transfer" 


f4 = "$10" 


maintain 


t3 = "$10" 


f5 = "from" 


maintain 


t4 = "from" 


f6 = "checking account" 


modify 


t5 = Hd"checking" 


f7 = "A" 


maintain 


t6 = "A" 


f8 = "to" 


maintain 


tl = "to" 


f9 = "savings account" 


modify 


t8 = Hd"savings" 


flO = "B?" 


maintain 


t9 = "B?" 



15 In this example, it is assumed that the hashing algorithm of Figure 2 is applied. Thus, since both 
fields f 1 and f2 corresponding to parent node hash value hl2 are omitted, transcoded field tl 
replaces received fields f 1 and f2. In transcoded fields t6 and t9, the delta hash value Hd is the 
difference between the received field hash value and the hash value for the modified data in each 
case. 

20 In the above embodiment, the message received from source device 1 is retained in 

transcoder memory 10, the transcoded message being generated as described by reference to the 
stored message. The original message is thus available in memory if required later as described 
further below. In other embodiments, however, the transcoder may produce the transcoded 
message by operating directly on the stored message received from source device 1, overwriting 

25 received fields during the transcoding process. 

Operation of the destination device control unit 12 will now be described with reference 
to the flow chart of Figure 5. The process begins at step 40 when the transcoded message is 
received. In step 41, the control unit stores the transcoded message in memory 13. In step 42, the 
control unit uses the content of the transcoded fields to obtain a set of definitive hash values for 

30 the original message sent by source device 1. The operations performed in this step depend on 
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whether a given transcoded field corresponds to a maintained, omitted or modified field. This 
may be identified by the control unit from the field content, an omitted field being indicated by a 
hash value, a modified field being indicated by a delta hash value plus ordinary data content, and 
data content alone signifying a maintained field. In the present embodiment, however, the nature 

5 of the field content is indicated by the field markers as described above. For maintained fields, 
the control unit applies the hash function H to the data content f to calculate the corresponding 
field hash values H(f). For a modified field, the control unit applies the hash function H to the 
data content f to obtain the modified field hash value H(f ), and then adds this to the delta hash 
value Hd to obtain the original field hash value H(f). For omitted fields, the required hash values 

10 are provided by the transcoded fields themselves. 

Following step 42, operation proceeds to step 43 wherein the control unit 12 calculates a 
received message code by applying the hash function H to the set of field/group hash values 
obtained in step 42 according to the tree structure of the hashing algorithm. In step 44 the control 
unit then compares the received message code with the original message code hl-10 in the 

15 authentication code Sgn(hl- 10) of the transcoded message. The original message code is 
obtained here by decrypting Sgn(hl-lO) using the appropriate public key, thus verifying the 
signature. In step 45 the control unit determines whether the received message code is identical 
to the original message code hl-10. If the codes do not match then the authentication process has 
failed and it can be assumed that the original message has been tampered with and should not be 

20 relied upon. In this case, in step 46 the control unit supplies an appropriate user message to 

display 14 for display to the user. For example, the displayed message may comprise the relevant 
content of the received transcoded message and a warning that the message failed the 

authentication process. 

Returning to step 45, assuming that the codes are determined to match here, then in step 

25 47 the control unit supplies the relevant content of the transcoded message, ie the message 

content of the maintained and modified fields, to the display 14. The displayed message can also 
indicate that the message has been authenticated, and preferably also indicates where fields have 
been modified or omitted as compared with the original message. As an example, for the 
transcoded message discussed above, the displayed message might read: 

30 Authenticated message received: 
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[dropped fields] transfer $10 from checking[modified field] A to savings [modified 
! field] B? 

^ The fact that the message has been authenticated assures the user that the document has 

\ not been altered beyond the omission and modification of fields during transcoding as indicated 

\ 5 in the display. Moreover, the transcoding process must have been performed using the hashing 

J algorithm as described above to generate the correct codes for omitted and modified fields. In 

addition, the system may allow the user to retrieve selected modified or omitted fields in case of 
doubt over whether the semantic content of a message has been altered in the transcoding 
process. In the present embodiment, for example, the user can select a displayed transcode 

i' 

> 10 indicator, ie "[dropped field]", "[modified field]" etc. in the displayed message using input 

means 15. On receipt of such an input, the control unit 12 transmits a request for the original 

K field to transcoder 2 via transceiver circuitry 1 1 . This request identifies the particular field 

|. required, for example by a field number identifying the location of the required field in the 

transcoded message. However identified, the transcode controller retrieves the required field 

^ 15 from its memory 10, and forwards the field to the destination device 4 for display to the user. 

\ It will be seen from the above that a highly efficient system is provided allowing 

verification of transcoder operation through elegantly simple processing operations which are 

\ practical to implement in system devices. It will be appreciated, however, that while preferred 

k 
h 

£ embodiments of the invention have been described in detail above, many changes and 

| 20 modifications may be made to the embodiments described without departing from the scope of 

| the invention. 

\ 
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CLAIMS 

1 . A method of transcoding a data message, comprising a plurality of data fields (f 1-f 10) 
and an authentication code (Sgn(hl-lO)), to produce a transcoded message for transmission to a 

5 destination device (4), the data message being received from a source device (1) wherein said 
data fields (f 1-f 10) have been coded in accordance with a first coding system, whereby 
respective data field codes (hl-hlO) are generated for said data fields (fl-flO) and a message 
code (hl-10) is derived from said data field codes (hl-hlO), and wherein said message code 
(hi -10) has been coded in accordance with a second coding system to generate said 
10 authentication code (Sgn(hl- 10)), the method comprising: 

determining for each data field (f 1-f 10) of the received data message whether to 
maintain, modify or omit that field; 

for a field to be maintained, maintaining that field in said transcoded message; 
for a field to be omitted, coding the field in accordance with said first coding system to 
15 generate an omitted field code dependent upon the data field code (h) for that field, and 
replacing that field by said omitted field code in the transcoded message; 

for a field to be modified, coding that field in accordance with said first coding system to 
generate a modified field code dependent upon the data field code (h) for that field, and 
replacing that field by a modified field, comprising modified data (f ) and said modified field 
20 code, in the transcoded message; and 

including said authentication code (Sgn(hl-lO)) in the transcoded message, 

2. A method as claimed in claim 1 wherein, for at least some instances of a field to be 
omitted, the omitted field code comprises the data field code (h) for that field. 



25 



3. A method as claimed in claim 2 wherein said message code (hl-10) has been derived by 
coding the data field codes (hl-hlO) for predetermined groups of fields to generate respective 
group codes, and wherein, for a field to be omitted: 
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if all fields in the corresponding group are to be omitted, then the omitted field code 
comprises the group code for that group, and the group of fields is replaced by said group code 
in the transcoded message; 

if less than all fields in the corresponding group are to be omitted, then the omitted field 
5 code comprises the data field code (h) for that field. 

4. A method as claimed in any preceding claim wherein, for a field to be modified, the 
modified field code is generated by generating the data field code (h) for that field and coding 
said modified data (f ) in accordance with said first coding system to generate a modified data 

10 code (H(f )), the modified field code being representative of the difference between that data 
field code (h) and modified data code (H(f )). 

5. A method as claimed in any preceding claim including inserting markers in the 
transcoded message, each marker indicating whether a respective portion of the transcoded 

1 5 message corresponds to a maintained, modified or omitted field of the received data message. 

6. A method as claimed in any preceding claim wherein said first coding system is a 
hashing algorithm whereby said data field codes (hl-hlO) are hash values. 

20 7. A method as claimed in claim 6 and claim 3 wherein said message code (hl-10) is the 
root hash value of a hash tree calculated from said data field codes (hl-hlO), and wherein said 
group codes are the hash values of respective parent nodes of said hash tree. 

8. A method of processing a data message, comprising a plurality of data fields (f 1-f 10) and 
25 an authentication code (Sgn(hl-lO)), received from a source device (1) wherein said data fields 
(fl-flO) have been coded in accordance with a first coding system, whereby respective data field 
codes (hl-hlO) are generated for said data fields (fl-flO) and a message code (hl-10) is derived 
from said data field codes (hl-hlO), and wherein said message code (hl-10) has been coded in 
accordance with a second coding system to generate said authentication code (Sgn(hl-lO)), the 
30 method comprising transcoding the received data message by a transcoding method as claimed 
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in any preceding claim, transmitting the transcoded message to the destination device (4), and, in 
said destination device (4): 

deriving a received message code from the transcoded message using maintained fields, 
modified fields and omitted field codes in said message in accordance with said first coding 
5 system; 

comparing the received message code with the message code (hi- 10) encoded in said 
authentication code (Sgn(hl-lO)) in accordance with said second coding system; and 

displaying a user message dependent upon the result of the message code comparison. 

10 9. A method as claimed in claim 8 wherein, at least if the received message code is 

identical to the message code (hi -10) encoded in said authentication code (Sgn(hl-lO)), the user 
message comprises the maintained data fields and modified data (F) from the transcoded 
message. 

15 10. A method as claimed in claim 9 wherein the user message includes transcode indicators 
indicative of the location in the displayed message of fields omitted or modified from the data 
message as sent by the source device. 



11. A method as claimed in claim 10 including: 

20 in the transcoding method, storing received data fields corresponding to omitted and 

modified fields; 

transmitting a stored field to the destination device (4) in response to a transcoded field 
request from the destination device (4); and 

at the destination device (4), displaying the stored field received pursuant to said request. 

25 

12. A transcoder (2) for transcoding a data message, comprising a plurality of data fields 
(fl-flO) and an authentication code (Sgn(hl-lO)), to produce a transcoded message for 
transmission to a destination device (4), the data message being received from a source device 
(1) wherein said data fields (fl-flO) have been coded in accordance with a first coding system, 

30 whereby respective data field codes (hl-hlO) are generated for said data fields (f 1-flO) and a 
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message code (hl-10) is derived from said data field codes (hl-hlO), and wherein said message 
code (hl-10) has been coded in accordance with a second coding system to generate said 
authentication code (Sgn(hl-lO)), the transcoder comprising: 

a memory (10) for storing the received data message; 
5 transcoder logic (9) configured to determine for each data field (f 1-f 10) of the received 

data message whether to maintain, modify or omit that field, and to produce the transcoded 
message from the received data message; and 

means (8) for transmitting the transcoded message to the destination device (4); 

wherein the transcoder logic (9) is configured to produce the transcoded message from 
10 the received data message by: 

for a field to be maintained, maintaining that field in said transcoded message; 

for a field to be omitted, coding the field in accordance with said first coding system to 
generate an omitted field code dependent upon the data field code (h) for that field, and 
replacing that field by said omitted field code in the transcoded message; 
15 for a field to be modified, coding that field in accordance with said first coding system to 

generate a modified field code dependent upon the data field code (h) for that field, and 
replacing that field by a modified field, comprising modified data (f ) and said modified field 
code, in the transcoded message; and 

including said authentication code (Sgn(hl-lO)) in the transcoded message. 

20 

13. A transcoder (2) as claimed in claim 12 wherein, for at least some instances of a field to 
be omitted, the omitted field code comprises the data field code (h) for that field. 

14. A transcoder (2) as claimed in claim 13 for transcoding a received data message for 
25 which said message code (hl-10) has been derived by coding the data field codes (hl-hlO) for 

predetermined groups of fields to generate respective group codes, wherein the transcoder logic 
(9) is configured such that, for a field to be omitted: 

if all fields in the corresponding group are to be omitted, then the omitted field code 
generated by the transcoder logic comprises the group code for that group, and the transcoder 
30 logic replaces that group of fields by said group code in the transcoded message; 
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if less than all fields in the corresponding group are to be omitted, then the omitted field 
code generated by the transcoder logic comprises the data field code (h) for that field. 

15. A transcoder (2) as claimed in any one of claims 12 to 14 wherein, for a field to be 
5 modified, the transcoder logic (9) is configured to generate the modified field code by generating 
the data field code (h) for that field and coding said modified data (f ) in accordance with said 
first coding system to generate a modified data code H(f), the modified field code being 
representative of the difference between that data field code (h) and modified data code H(F). 

10 16. A transcoder (2) as claimed in any one of claims 12 to 15 wherein the transcoder logic 
(9) is configured to insert markers in the transcoded message, each marker indicating whether a 
respective portion of the transcoded message corresponds to a maintained, modified or omitted 
field of the received data message. 

15 17. A transcoder (2) as claimed in any one of claims 12 to 16 wherein said first coding 
system is a hashing algorithm whereby said data field codes (hl-hlO) are hash values. 

18. A transcoder (2) as claimed in claim 17 and claim 14 wherein said message code (hl-10) 
is the root hash value of a hash tree calculated from said data field codes (hl-hiO), and wherein 

20 said group codes are the hash values of respective parent nodes of said hash tree. 

19. A transcoder (2) as claimed in any one of claims 12 to 18, wherein the transcoder logic 
(9) is further configured to output a stored field of the received data message to the transmitter 
means (8) for transmission to the destination device (4) in response to receipt of a transcoded 

25 field request from the destination device (4). 

20. A destination device (4) for receiving a transcoded message from a transcoder (2) as 
claimed in any one of claims 12 to 19, the device (4) comprising a memory (13) for storing a 
received transcoded message, a display (14), and control logic (12) configured to: 
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derive a received message code from the transcoded message using maintained fields, 
modified fields and omitted field codes in said message in accordance with said first coding 
system; 

compare the received message code with the message code (hi- 10) encoded in said 
5 authentication code (Sgn(hl-lO)) in accordance with said second coding system; and 

to supply a user message, dependent upon the result of the message code comparison, to 
the display (14) for display to a user. 

21. A device (4) as claimed in claim 20 wherein, at least if the received message code is 

10 identical to the message code (hl-10) encoded in said authentication code (Sgn(hl-lO)), the user 
message comprises the maintained data fields and modified data (f ) from the transcoded 
message. 

22. A device (4) as claimed in claim 21 wherein the user message includes transcode 

15 indicators indicative of the location in the displayed message of fields omitted or modified from 
the data message as sent by the source device (1). 

23. A device (4) as claimed in any one of claims 20 to 22 for receiving a transcoded message 
from a transcoder (2) as claimed in claim 19, the device (4) including user input means (15) and 

20 means (1 1) for transmitting a said transcoded field request to the transcoder (2), wherein the 
control logic (12) is configured to generate the transcoded field request in response to a user 
input via said input means (15). 

24. A data delivery system comprising a transcoder (2) as claimed in any one of claims 12 to 
25 19 and one or more destination devices (4) as claimed in any one of claims 20 to 23. 

25. A data communication system comprising a data delivery system as claimed in claim 24, 
and a source device (1) for generating a data message for transmission to a said destination 
device (4), the source device (1) comprising message processing logic (6) configured to: 

30 divide data to be included in the data message into a plurality of data fields (f 1-flO); 
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code said data fields (fl-flO) in accordance with said first coding system whereby 
respective data field codes (hl-hlO) are generated for said data fields (fl-flO) and a message 
code (hl-10) is derived from said data field codes (hl-hlO); 

and to code said message code (hl-10) in accordance with said second coding system to 
5 generate an authentication code (Sgn(hl-lO)) for the message; 

the source device (1) including means (7) for transmitting a data message, comprising 
said plurality of data fields (fl-flO) and said authentication code (Sgn(hl-lO)), to the transcoder 
(2) of said data delivery system. 
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ABSTRACT 



TRANSCODING IN DATA COMMUNICATIONS 



Methods and apparatus are provided for transcoding a data message, comprising a 
5 plurality of data fields (fl-flO) and an authentication code (Sgn(hl-lO)), to produce a transcoded 
message for transmission to a destination device (4). The transcoding methods can be applied to 



such a data message which is received from a source device (1) wherein said data fields (fl-flO) 
have been coded in accordance with a first coding system, whereby respective data field codes 
(hl-hlO) are generated for said data fields (fl-flO) and a message code (hl-10) is derived from 

10 said data field codes (hl-hlO), and wherein said message code (hl-10) has been coded in 

accordance with a second coding system to generate said authentication code (Sgn(hl-lO)). For 
each data field (f 1-f 10) of the received data message it is decided whether to maintain, modify or 
omit that field. For a field to be maintained, that field is maintained in the transcoded message. 
For a field to be omitted, that field is coded in accordance with said first coding system to 

1 5 generate an omitted field code dependent upon the data field code (h) for that field, and that field 
is replaced by said omitted field code in the transcoded message. For a field to be modified, that 
field is coded in accordance with said first coding system to generate a modified field code 
dependent upon the data field code (h) for that field, and that field is replaced by a modified 
field, comprising modified data (f ) and said modified field code, in the transcoded message. 

20 The received authentication code (Sgn(hl-lO)) is also included in the transcoded message. 
Sufficient information is thereby included in the transcoded message to enable the destination 
device to verify the transcoding operation. 
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